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Abstract. We apply algorithmic information theory to quantum mechanics in order 
to shed light on an algorithmic structure which inheres in quantum mechanics. 

There are two equivalent ways to define the (classical) Kolmogorov complexity 
K(s) of a given classical finite binary string s. In the standard way, K{s) is defined 
as the length of the shortest input string for the universal self-delimiting Turing 
machine to output s. In the other way, we first introduce the so-called universal 
probability m, and then define K(s) as — log 2 m(s) without using the concept of 
program-size. We generalize the universal probability to a matrix-valued function, 
and identify this function with a POVM (positive operator- valued measure). On the 
basis of this identification, we study a computable POVM measurement with count- 
able measurement outcomes performed upon a finite dimensional quantum system. 
We show that, up to a multiplicative constant, 2~ K ( S ' is the upper bound for the 
probability of each measurement outcome s in such a POVM measurement. In what 
follows, the upper bound 2^ K ^ is shown to be optimal in a certain sense. 



Key words: algorithmic information theory, universal probability, POVM, com- 
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1 Introduction 



Algorithmic information theory is a theory of program-size complexity which has precisely the 
formal properties of classical information theory. In algorithmic information theory, the program- 
size complexity (or Kolmogorov complexity) K(s) of a finite binary string s is defined as the length 
of the shortest binary input for the universal self-delimiting Turing machine to output s. The 
concept of program-size complexity plays an important role in characterizing the randomness 
of a finite or infinite binary string. In this paper we extend algorithmic information theory to 
quantum region in order to throw light upon an algorithmic feature of quantum mechanics. We 
show that Kolmogorov complexity gives the upper bound for the probability of each measurement 
outcome in a computable POVM measurement with countable outcomes performed upon a finite 
dimensional quantum system. 
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1.1 Main result 



In this paper, we consider a quantum measurement performed upon a finite dimensional quan- 
tum system. A positive operator-valued measure (POVM) is a collection {E(m)} of positive 
semi-definite Hermitian matrices which satisfies Ylm E(m) = I where / is the identity matrix. 
Each E(m) is called a POVM element of this POVM. In general, the statistics of outcomes in 
a quantum measurement are described by a POVM {E(m)}. The label m refers to the mea- 
surement outcomes that may occur in the experiment. If the state of the quantum system is 
described by a normalized vector immediately before the measurement, then the probability 
that result m occurs is given by {ip\E(m)\ip). On the other hand, if the ensemble of the states 
of the quantum system is described by a density matrix p immediately before the measurement, 
then the probability that result m occurs is given by tr(pE(m)). A POVM measurement is a 
generalization of a projective measurement which is described by an observable. The number of 
outcomes in a POVM measurement can be more than the dimension of the state space of the 
quantum system being measured, whereas the number of outcomes in a projective measurement 
cannot. In this paper, we relate an argument s of K(s) to an outcome which may occur in 
the quantum measurement performed upon a finite dimensional quantum system. Since K(s) 
is defined for all finite binary strings s, the countable outcomes have to be available in the 
corresponding quantum measurement. Thus we deal with a POVM measurement and not a 



projective measurement. (See e.g. [11, [L2| for the details of POVM measurements.) 

We say a POVM is computable if one can compute all its POVM elements to any desired 
degree of precision, and a POVM measurement is said to be computable if it is described by a 
computable POVM. Our main result is as follows: Let {R(s)} be a computable POVM on a 
finite dimensional quantum system whose each element is labeled by a finite binary string. Then 
there exists an integer d such that, for all density matrix p and all finite binary string s, 

K{s) -d<-log 2 tr ( P R(s)), (1) 

and also there exists a real number c > such that, for all density matrix p and all finite binary 
string s, 

ti(pR(s)) < cP{s). (2) 

Here P(s) is the probability that the (classical) universal self-delimiting Turing machine halts 
and outputs s when it starts on the program tape filled with an infinite binary string generated 
by infinitely repeated tosses of a fair coin. 

The inequality ([[]) states that, up to an additive constant, K(s) is the lower bound for the 
— log 2 of the probability of each measurement outcome s in a computable POVM measurement 
with countable outcomes performed upon a finite dimensional quantum system, i.e., 2~ K ^ is 
the upper bound for the probability of each outcome s up to a multiplicative constant. On the 
other hand, the inequality (g) states that, up to a multiplicative constant, P(s) is the upper 
bound for the probability of each measurement outcome s in the same measurement. Note that 
the inequalities (|l|) and (|2|) are equivalent to each other. 

The computability of a POVM measurement is thought to be intrinsic in the case where one 
performs the measurement in order to extract a valuable information from a quantum system 
because in such a case one has to be able to compute to any desired degree of precision all POVM 
elements of the POVM which describes the measurement. Hence, when one wants to extract a 
valuable information from a finite dimensional quantum system through a POVM measurement 
with countable outcomes, one faces with the limitation given by the inequality (jl|) (equivalently 

•>y (!))• 

Especially, the inequality @ is interesting. Since P(s) is a probability which results from 
infinitely repeated tosses of a fair coin, P(s) is just a classical probability. In the case where p 
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is a pure state, the inequality (||) states that a purely quantum mechanical probability bounded 
from above by a purely classical probability up to a multiplicative constant when one performs a 
computable POVM measurement with countable outcomes upon a finite dimensional quantum 
system in the pure state p. 

The inequalitis (|l]) and (||) are obtained through a generalization of the so-called universal 
probability to a matrix-valued function. The Kolmogorov complexity K{s) of a finite binary 
string s is originally defined using the concept of program-size. However, there is another 
way to define K(s) without referring to such a concept, that is, we first introduce a universal 
probability m, and then define K(s) as — log 2 m(s). The universal probability is a function 
from the set of finite binary strings to the open interval (0, 1). In this paper we generalize the 
universal probability to a matrix- valued function while keeping the domain of definition the set 
of finite binary strings. Then this generalized universal probability is identified with an analogue 
of a POVM, and is called a universal semi-POVM. The inequalities (||) and (g) naturally follow 
from this identification. 



1.2 Related works 

Our aim is to generalize algorithmic information theory in order to understand the algorithmic 
feature of quantum mechanics. There are related works whose purpose is mainly to define the 
information content of an individual pure quantum state, i.e., to define the quantum Kolmogorov 
complexity of the quantum state fi~3| , 0, ||, while we will not make such an attempt in this paper. 

As we mentioned above, K(s) can be defined as the — log 2 of the universal probability without 
using the concept of program-size. [|| took this approach in order to define the information 
content of a pure quantum state. (8| first generalized the universal probability to a matrix- 
valued function called quantum universal semi-density matrix. The /i is a function which 
maps any positive integer N to an N x N positive semi-definite Hermitian matrix n(N) with its 
trace less than or equal to one. |§ proposed to regard £i(iV) as an analogue of a density matrix 
of a quantum system called semi-density matrix. Then, in order to measure the information 
content of a pure quantum state l^) G C N , |8| introduced the quantum algorithmic entropies 
H_(\ip)) and H{\ip)) as — log 2 [/^(A r ) IV 7 ) and — (V>|(log 2 fj, (TV) )\ip), respectively. In general, the 
trace of a density matrix has to be equal to one. If the trace of n(N) is equal to one, then 
the quantity (tf)\n(N)\vp) in the definition of H_{\ip)) has the meaning of the probability that 
the outcome is 'yes' when one performs the projective measurement described by the projector 
IV'XV'I upon the quantum system in the mixed state n(N). However, the trace of fi(N) is not 
equal to one for all but finitely many N because of its universality. (This fact is implicitly 
mentioned in §|. For completeness, we include a proof of this fact in Appendix [A], in addition 
to the definition of //.) 

In quantum mechanics, what is represented by a matrix is either a quantum state or a 
measurement operator. In this paper we generalize the universal probability to a matrix-valued 
function in different way from ||, and identify it with an analogue of a POVM. We do not 
stick to defining the information content of a quantum state. Instead, we focus our thoughts 
on applying algorithmic information theory to quantum mechanics in order to shed light on an 
algorithmic structure of quantum mechanics. In this line we have the above inequalities (||) and 



In each of 1 13] and H, the quantum Kolmogorov complexity of a qubit string was defined as a 
quantum generalization of the standard definition of classical Kolmogorov complexity; the length 
of the shortest input for the universal decoding algorithm U to output a finite binary string. 
Both [13] and Q adopt the universal quantum Turing machine as a universal decoding algorithm 
U to output a quantum state in their definition. However, there is a difference between [O] and 



with respect to the object which is allowed as an input to U. That is, [ 13 1 can only allow a 
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classical binary string as an input, whereas j|] can allow any qubit string. The works |l3|], 0], 
and Q are closely related to one another as shown in each of these works. In comparison with 
our work, since our work is, in essence, based on a generalization of the universal probability, 
the work ||] is more related to our work than the works fl3[| and These two works may be 
related to our work via the work Q. 

1.3 Organization of the paper 

We begin in Section || with some basic definitions, and review some results of algorithmic in- 
formation theory. In Section [|, we prove the inequalities (||) and (|2|) via the introduction of a 
universal semi-POVM as a generalization of the universal probability. In Section f|, we consider 
the optimality of our upper bound 2~ K ^ and P(s) for the probability of each measurement 
outcome s. Finally, we study some other properties of a universal semi- POVM in Section |. 

2 Preliminaries 
2.1 Notation 

We start with some notation about numbers and matrices which will be used in this paper. 

N = {0, 1, 2, 3, . . . } is the set of natural numbers, and N + is the set of positive integers. Z 
is the set of integers. Q is the set of rational numbers, and Q + is the set of positive rational 
numbers. R is the set of real numbers, and C is the set of complex numbers. Cq is the set of 
the complex numbers in the form of a + ib with a, b E Q. We define — log 2 as oo. 

We fix N to be any one positive integer throughout this paper. For each matrix A, A T is 
the transpose of A and A^ is the adjoint of A. For each K C C, M^{K) is the set of the N x N 
matrices whose elements are in K, and K N is the set of column vectors consist N complex 
numbers in K. For each x = (xi,X2, • • • , £tv) T € C^, \\x\\ is defined as (|xi| 2 + |x2| 2 + • • • + 
\x N \ 2 ) 1 / 2 . For each A,B £ M N (C), [A, B] is defined as AB - BA. For each A E M N (C), \\A\\ 
is the operator norm of A, and tr^4 denotes the trace of A. The identity matrix in Mjv(C) is 
denoted by I. U (N) is the set of N x N unitary matrices. Her(iV) is the set of N x N Hermitian 
matrices. For each A, B E Her(iV), we write A B if B — A is positive semi-definite, and write 
A < B if B — A is positive definite. Note that the relation ^ on Her(iV) is a partial order. In 
this paper we will frequently use the property: ||^4|| < e — el ^ A ^ el for any e > and 
any A E Her(iV). We say p is a density matrix if ^ p E Her(iV) and tr(p) = 1. Her ( Q(A r ) is the 
set of N x N Hermitian matrices whose elements are in Cq. diag(xi, . . . ,xn) is the diagonal 
matrix whose (i, i)-elements is Xj. 

Let S be any set, and let /, g : S — ► Her(iV). Then we write f(x) = g(x) + O(l) if there is 
a real number c > such that, for all x E S, \\f(x) — g(x)\\ < c. We also write f(x) ~ g(x) if 
there is a real number c > such that, for all x E S, cf(x) ^ g(x) and cg(x) ^ f(x). 

S* = {A, 0, 1, 00, 01, 10, 11, 000, 001, 010, . . . } is the set of finite binary strings where A de- 
notes the empty string, and S* is ordered as indicated. We identify any string in X* with a 
natural number in this order, that is, we consider (p: X* — > N such that <p(s) = Is — 1 where 
the concatenation Is of strings 1 and s is regarded as a dyadic integer, and then we identify s 
with ip(s). For any s E S*, |s| is the length of s. A subset S of X* is called a prefix-free set if no 
string in S is a prefix of another string in S. 

For each F: X* — > Mjv(C), we say F is computable if there exists a total recursive function 
G: X* x N -» Miv(CQ) such that, for all s E X* and all fc E N, ||F(s) - G(a,fc)|| < 2" fc . 
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2.2 Algorithmic information theory 

In the following we review some definitions and results of algorithmic information theory |], 
M. We assume that the reader is familiar with algorithmic information theory in addition to 
computability theory. 

A computer is a partial recursive function C : X* — > T,* whose domain of definition is a prefix- 
free set. For each computer C and each s G £*, Kc{s) is defined as min { \p\ | p € S* & C(p) = s }. 
A computer U is said to be optimal if for each computer C there exists a constant sim(C) 
with the following property: if C(p) is defined, then there is a p' for which U(p') = C(p) and 
\p'\ < \p\ + sim(C). It is then shown that there exists a computer which is optimal. We choose 
any one optimal computer U as the standard one for use throughout the rest of this paper, and 
we define K(s) = Kjj(s), which is referred to as the information content of s, the program- 
size complexity of s, or the Kolmogorov complexity of s. For each s G £*, P(s) is defined by 
P( s ) — ^2u(p)=s 2 - ' p '- The class of computers is equal to the class of functions which are com- 
puted by self- delimiting Turing machines. A self-delimiting Turing machine has a program tape 
and a work tape. The program tape is infinite to the right, while the work tape is inifinite in 
both directions. The machine starts with an input string on its program tape and the work tape 
blank. When the machine halts, the output string is put on the work tape. (For the details of 
self-delimiting Turing machine, see ||.) A self-delimiting Turing machine is called universal if 
it computes an optimal computer. Let Mjj be a universal self-delimiting Turing machine which 
computes U. Then P(s) is the probability that M\j halts and outputs s when Mjj starts on the 
program tape filled with an infinite binary string generated by infinitely repeated tosses of a fair 
coin. 

A universal probability is defined through the following two definitions. 

Definition 2.1. For any r: S* — > [0,oo) ; we say that r is a lower- computable semi-measure if 
r satisfies the following two conditions: 

(i) Z s& r(s)<l. 

(ii) There exists a total recursive function f : N x E* — > Q such that, for each s 6 S* ; 
linin^oo /(n, s) = r(s) and Vn G N /(n, s) < f{n + 1, s). 

Definition 2.2. Let m be a lower- computable semi-measure. We say that m is a universal 
probability if for any lower- computable semi-measure r, there exists a real number c > such 
that, for all s G £*, cr(s) < m(s). 

Then the following theorem holds. 

Theorem 2.3. Both 2~ K ( S ^ and P(s) are universal probabilities. 



By Theorem 2.3 , we see that, for any universal probability m, 

K(s) = -log 2 m(s) + 0(l). (3) 

Especially we have K(s) = — log 2 P(s) + 0(1)- Any universal probability is not computable, 
which corresponds to the uncomputability of K(s). Moreover we can show the following, from 
which the uncomputability of a universal probability follows. 

Theorem 2.4. Let m be a universal probability, and let f: N — > Q + and r: N — > X*. Suppose 
that both f and r are total recursive functions, and m(r(n)) < f(n) for all n G N. Then 
inf sgS , /(n) > 0. 
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The information theoretic feature of algorithmic information theory can be developed as 
follows. We choose any one computable bijection < s,t > from (s,t) G S* x S* to £*. Let 
s,t G £*. The joint information content K(s,t) of s and i is defined as K(s,t) = K(< s,t >). 
We then define the relative information content K(s\t) of s relative to t by the equation 

K(s\t) = K(t,s) - K(t). 

Finally we define the mutual information content K(s : t) of s and t by the equation 

K{s : t) = Kit) - K{t\s) ee K(s) + K(t) - K(s,t). 

Then, without referring to the concept of program-size, [|J proved the following relations using 
the fact that 2~ K ^ is a universal probability. 

Theorem 2.5. 



(i) 


K(s, 


t) = K(t,s) + 0(l). 


(ii) 


K(s : 


t)=K{t:s) + 0{l). 


(Hi) 


K(s : 


s) = K(s) + 0(l). 


(iv) 


3cG 


R Vs,t G S* c < K(s|t). 


(v) 


3cG 


R Vs,t G S* c < K(s : t) 


(vi) 


K(s : 


t) = if(t:s) + 0(l). 


(vii) 


K(s : 


s) = K(s) + 0(l). 


( viii ) 


K(s : 


A) = 0(1). 



Thus algorithmic information theory has the formal properties of classical information theory. 

3 Generalization of universal probability to POVM 

In this section we generalize a universal probability to a matrix-valued function. Based on this 
generalization, we prove our main result: Theorem [T^. 

Definition 3.1. We say R is a semi-POVM on S* if R is a mapping from T,* to Her(iV) which 
satisfies ^ R(s) for all s G S* and X^ses* R( s ) ^ ^ ■ We say R is a POVM on S* if R is 
semi-POVM on S* and E se s* R ( s ) = L 

Let Q be a POVM on £*. The POVM measurement described by Q is performed upon a 
finite dimensional quantum system, and gives one of countable measurement outcomes, which 
are represented by finite binary strings. 

Given R: semi-POVM on £*, it is easy to convert R into a POVM on X* by appending 
an appropriate positive semi-definite matrix to R. Let = X^ses* R( s )i an d then we define 
Q: T,* — > Her(iV) by Q(X) = 7 — and Q(s') = R(s) for each sSS* where s' is the successor 
of s. Then Q is a POVM on £*. Thus a semi-POVM on E* has a physical meaning in the same 
way as a POVM on £*. 

Definition 3.2. We say R is a lower- computable semi-POVM if R is a semi-POVM on S* 
and there exists a total recursive function / : N x S* —* HerQ(iV) such that, for each s G £*, 
lim^oo /(n, s) = R(s) and Vn G N f(n,s)^R(s). 
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In the case where N = 1, Definition |3.2| exactly results in the definition of a lower-computable 
semi-measure. For the handiness, we do not reqiure in the above definition that the /(n, s) 
conversing to R(s) is non-decreasing (i.e., f(n,s) ^ f{n + l,s)). However, we can equivalently 
assume that the f(n,s) is non-decreasing in the definition. See Appendix [B] for its proof. 

The following is a key theorem for our main result. 

Theorem 3.3. If R is a lower- computable semi-POVM, then the mapping X* 3 s i — > 
is a lower- computable semi-measure. 

Proof. Let r: X* — > [0, oo) with r(s) = i||i?(s)||. Note that \\A\\ < tr A for any positive semi- 
definite A. Thus, since ^ R(s) for all s G X* and X^ses* ^ we see that XLe£* r ( ,s ) — 
treses* -^( s ) < 77 trl = 1. Thus the condition (i) in Definition 2A holds for r. 



Next we show that the condition (ii) in Definition |2.1| holds for r. Since R is a lower- 
computable semi-POVM, there exists a total recursive function /: N x X* — * HerQ)(iV) such 
that for each s G X*, hu^^oo /(n, s) = i?(s) and Vn € N f(n,s) ^ i?(s). From the definition 
of the operator norm, ||/(n,s)|| is the supremum of (ip\f(n,s)\tjj)/{'ip\ijj) such that \ip) ^ and 
G C^. Since Q is dense in R, it is easy to see that ||/(n, s)|| is equal to the supremum of 
(ip\f(n, s)\tp) / such that \ip) ^ and each component of is a complex number in the 
form of a + ib with a, b € Z. Thus, given n G N and s G X*, one can generate a sequence of 
rational numbers pi,p2,--- such that p\ < P2 < • • • < ||/(n,s)|| and limm^ooPm = ||/(n,s)||. 
On the other hand, using the property A ^ B \\A\\ < \\B\\, we have \\f(n, s)\\ < 
and lim^^oo ||/(n, s)|| = ||i?(s)||. Hence, given s G X*, one can generate a sequence of rational 
numbers xi,X2,--- such that x\ < X2 < • • • < ||i2(s)|| and lim^^oo x n = ||U(s)||. Therefore the 



condition (ii) in Definition 2.1 holds for r. Hence r is a lower- computable semi- measure. □ 



Definition 3.4. Let M be a lower- computable semi-POVM. We say that M is a universal semi- 
POVM if for each lower- computable semi-POVM R, there exists a real number c > such that 
for all s G X*, cR{s) < M{s). 



In the case where N = 1, Definition [3.4| exactly results in the definition of a universal 
probability. The use of the partial order ^ for the purpose of generalizing lower-computable 
semi- measure and universal probability to matrix- valued functions is suggested in || . Note that 
if M is a universal semi-POVM then, for all s G X*, M(s) is positive definite. 

A universal semi-POVM may have a simple form as the following theorem says. 

Theorem 3.5. Ifm is a universal probability, then the mapping X* 3 s i — ► m(s)I is a universal 
semi-POVM. 

Proof. Let M: X* — > Her(A r ) with M(s) = m(s)I. Since m is a lower-computable semi-measure, 
it is obvious that M is a lower-computable semi-POVM. Suppose that R is a lower-computable 
semi-POVM. By Theorem the mapping E* 9 s i — > jj\\R{s)\\ is a lower-computable semi- 



measure. Thus, since m is a universal probability, there is c > such that, for all s G X*, 
Cjy||i?(s)|| < m(s). Therefore we have jjR(s) si m(s)I for all s G X*. Hence M is a universal 
semi-POVM. □ 

For this universal semi-POVM m(s)I, we have [m(s)J, m(t) I] = for all s and i G X*. 
However the following theorem guarantees an existence of a 'non-trivial' universal semi-POVM. 

Theorem 3.6. There exists a universal semi-POVM M such that [M(s),M(t)\ ^ for any 
distinct s and t G X*. 
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Proof. We choose any one universal probability m, and choose any one pair of G and H G 
Her Q (iV) such that < G, H < / and [G,H] + 0. We define M: £* -> Her(iV) by 

Then we see that, for any distinct s and t £ S*, 

[M(s), M{t)\ = ^m(s)m(t) {tt^ - [G, H] + 0. 

Since m is a lower-computable semi-measure, M is shown to be a lower-computable semi-POVM. 
It follows from < H that there is c > such that cl ^ H. Thus |m(s)J ^ M(s). Since m(s)J 
is a universal semi-POVM, M is also a universal semi-POVM. □ 

The following theorem is more general form of our main result. 

Theorem 3.7. Let m be a universal probability, and let R be a lower- computable semi-POVM. 
Then the following (i) and (ii) hold: 

(i) There exists c > such that, for any normalized \tp) G and any s G £*, 

(tp\R(s)\rp) < cm(s). 

(ii) There exists c > such that, for any density matrix p G Her(iV) and any s G £*, 

tr(pR(s)) < cm(s). 



Proof. It follows from Theorem 3.5 that (i) holds. Using (i) and the spectral decomposition of 
p, we have (ii). □ 



In order to make more clear the physical implication of Theorem 3.7, we restrict our attention 
to a POVM on X* which is computable. Informally, a POVM on S* is computable if and 
only if one can compute all its POVM elements to any desired degree of precision. Thus the 
computability of a POVM is thought to be inherent in the case where one wants to perform a 
well-controlled quantum measurement described by the POVM. Using the following lemma, we 
have our main result about a computable POVM. 

Lemma 3.8. Let R be a semi-POVM on S*. If R is computable then R is a lower- computable 
semi-POVM. 

Proof. Since R is computable, there exists a total recursive function G: S*xM-> Mtv(Cq) such 
that, for all s G £* and all k G N, \\R(s) - G(s, k)\\ < 2~ k . We define PL: S*xN^ M N (C Q ) by 
H(s, k) = 2 {G(s, k) + G(s, ky\. Then H is a total recursive function and, for every s G S* and 
every k G N, H(s, k) G Her Q (iV) and \\R(s) - H(s, k)\\ < 2~ k . Thus we have H(s, k) - 2~ k L < 
R(s) and lim^oo H(s, k) — 2~ k L = R(s). Hence the result follows. □ 

Theorem 3.9 (Main result). Let R be a computable POVM on S*. Then the following hold: 

(i) There exists d G N such that, for any density matrix p G Her(iV) and any s G £*, 

K(s)-d<-log 2 tv(pR(s)). (4) 

(ii) There exists c > suc/i that, for any density matrix p G Her(iV) and any s G S* , 

trGaR(s)) < cP(s). (5) 



Proof. Theorem 3.9 immediately follows from Theorem 2.3, (ii) in Theorem 3.7, and Lemma 
O. □ 
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4 Optimality of universal semi-POVM 



In this section we consider an optimality of a universal semi-POVM. By Theorem 3.5 we have 
the following theorem. 

Theorem 4.1. Let M be a universal semi-POVM. and let m be a universal probability. Then 
M(s) ~ m(s)I. 



The following theorem immediately follows from Theorem 4.1. This theorem is the most 
general form which represents the optimality of a universal-semi POVM from the view point of 
the probability of each measurement outcome. 

Theorem 4.2. Let m be a universal probability, and let M be a universal semi-POVM. Then 
there exist c\ > and C2 > such that, for any density matrix p G Her(iV) and any s G E*, 

cim(s) < tr(pM(s)) < C2m(s). 



By Theorem 2.3 and Theorem |4.2|, we have Theorem |4.3| 



Theorem 4.3. Let M be a universal semi-POVM. Then, for any density matrix p G Her(iV) 
and any s G £*, 

A-(a) = -log 2 tr(pAf( a )) + 0(l), 
P(s) ~ tr(pM(s)). 

Thus, if we can perform the POVM measurement described by a universal semi-POVM, then 
we can achieve the upper bound P(s) (or 2~ K ^) in Theorem [il] up to a multiplicative constant. 
However any universal semi-POVM is not computable (see Subsection 5.2). Moreover we can 



show that there is no computable semi-POVM on £* which can achieve the upper bound P(s) 
(or 2~ K ^) up to a multiplicative constant. Instead, by the definition of universal semi-POVM, 
we have the following theorem, which states that we can approximate any universal semi-POVM 
by a recursive sequence of semi-POVMs on £* from below. 

Theorem 4.4. For any universal semi-POVM M, there exists a sequence Fq,Fi,F2, ■ . . of 
semi-POVMs on E* such that 

(i) F n (s) G Her Q (A0 and < F n (s) < F n+1 {s) < M(s) for all (n,s) G N x £*, 

(ii) the sequence Fq, F\, F2, . . . of functions uniformly converges to M , and 

(Hi) the mapping NxE*3 (n, s) * — > F n (s) is a total recursive function. 



Proof. Since M is a universal semi-POVM, by Theorem B.l in Appendix [B|, there exists a total 
recursive function j:NxS* — > HerQ(iV) such that, for each s € S*, lim^^oo g(n, s) = M{s) 
and Vn € N g(n, s) < g(n + l,s) < M{s). Note that < M(s) for any s G E*. Thus, there 
exists a total recursive function r : N x E* — > N such that, for each s and n, r(n, s) < r(n + 1, s) 
and < g(r(n, s), s). We define the sequence Fq, iq,i^, ... of semi-POVMs on E* by F n (s) = 



g(r(n, s),s). It is then obvious that (i) and (iii) in Theorem 4.4 hold for this sequence. For any 
e > 0, there is so G E* such that ^ s >s M(s) < el, so we see that Hi^s) — M(s)|| < e for all 
n G N and all s > sq. On the other hand, it is easy to see that there is no G N such that, for 



all n > no and all s < sq, \\F n (s) — M(s)\\ < e. Thus (ii) in Theorem iA holds for the sequence 



Fq, Fi, F2, . . . of functions. □ 
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For the recursive sequence Fq, F±, F 2 , . . . of semi-POVMs on X* given in Theorem ^J, F n 
is a computable semi-POVM on S* for each n € N. However, since any universal semi-POVM 
is not a POVM on S* (see Subsection U) and F n (s) ^ M(s), F n is not a POVM on E* for 
each n. Instead, we can also consider the recursive sequence Gi, G2, G3, . . . of POVMs defined 
as follows: Each POVM element of G n is labeled by a finite binary string less than or equal to 
if(n). For any s < y?(n), G n (s) is defined as F n (s), and G n (ip(n)) is defined as I — Yl s «p(n) ^n( s )- 
Then, since F n (s) € Her ( Q(A r ), any given n € N + , one can calculate all POVM elements of G n . 
Note that the POVM measurement described by G n gives one of n + 1 measurement outcomes, 
each of which is represented by a finite binary string less than or equal to <p(n). By Theorem 



4.4, we have the following: 



• Any given e > 0, for all sufficiently large n G N + , if s < y?(n) and p is a density matrix 
then < tr(pM(s)) - tr(pG n (s)) < e. 

Thus, in the sense that the above statement holds, the recursive sequence G\, G2, G3, . . . , G n , . . . 
of POVMs converges to the universal semi-POVM M from below as n — > 00. 



5 Other properties of universal semi-POVM 

In this section we study the properties of universal semi-POVM further. 
5.1 Matrix- valued algorithmic information theory 

Let M be any one universal semi-POVM. The equation (||) suggests defining a matrix-valued 
Kolmogorov complexity IC(s) ofs G E* by 

/C(s) = -log 2 M( S ). (6) 

For this definition of /C, it follows from Theorem that 

K{s) = K{s)I + 0{l). (7) 

Further we can define fC(s,t), )C(s\t), and KL{s : t) in the same manner as the definitions of 
K(s,t), K(s\t), and K(s : t), respectively. Then using (0) we see that all the relations in 
Theorem |2.5| hold for these /C's in place of the K 7 s. 

Note that K{s) is originally defined using the concept of program-size. Since /C(s) is related 
to K(s) through the equation (Q), )C(s) have the meaning of program-size in some weak sense. It 
is interesting if we can find a more concrete definition of /C(s) using something like the concept 
of program-size instead of the equation @. However, this is still open. 

In order to measure the information content of a quantum state G C N , ^ intro- 
duced the quantum algorithmic entropies H(\ip)) and H(\i/i)) of \ip) as — log 2 (V ; lM(-^)lV') an d 
— (■0|(log 2 /x(A r ))|'0), respectively, using his quantum universal semi-density matrix n (see Ap- 
pendix ^ for its definition). In this behalf note that, for our universal semi-POVM M, the 
following holds for any normalized \ip) G C^: 

K(s) = -log 2 (^|M( S )|^) +0(1) = -M(tog 2 M(a))M + 0(l). 

Thus — log 2 (ip\M(s)\iJj) and — (^|(log 2 M(s))\ip) are independent of up to an additive con- 
stant. So it would seem difficult to measure the information content of a quantum state 
using these quantities in the similar manner to Q , although such an attempt is not the purpose 
of this paper. 
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5.2 Relation to universal probability 



We say x G is computable if each component of x is in the form of a + ib where a and b 



are computable real numbers. Theorem 5.1 describes a property of a universal semi-POVM as 
a universal probability. 

Theorem 5.1. Let M be a universal semi-POVM, and let x € C N be computable with \\x\\ = 1. 
Then the mapping T,* 3 s i — ► x^M(s)x is a universal probability. 

Proof. Since x is computable, x^M(s)x is shown to be a lower-computable semi-measure. Let 
m be a universal probability. Then, by Theorem 4.1 , we see that x^M(s)x ~ m(s). Thus the 



result follows. □ 



Let M be a universal semi-POVM. Then, by Theorem |5.1| , each diagonal element Ma(s) of 
M(s) is a universal probability as a function of s, and i tr(M(s)) is also a universal probability 
as a function of s. Since any universal probability is not computable, any one diagonal element 
Mjj(s) is not computable. Hence any universal semi-POVM is not computable. If M is a POVM 
on £*, then, since M is a lower-computable semi-POVM, we can show that M is computable. 
Thus any universal semi-POVM is not a POVM on £*. 

5.3 Computable unitary invariance 

We say A € Mjv(C) is computable if each element of A is in the form of a + ib where a and b are 
computable real numbers. The following theorem states an invariance of a POVM measurement 
described by a universal semi-POVM under computable unitary transformation on the quantum 
state being measured. 

Theorem 5.2. Let M be a universal semi-POVM, and let U E U(N) be computable. Then the 
mapping X* 3 s i — ► U<M(s)U is a universal semi-POVM. 

Proof. We note the property that A < B => AX < X^BX for any A,B<E Her(iV) and any 
X G Mjy(C). Since U is computable, WM(s)U is shown to be a lower-computable semi-POVM. 
Let m be a universal probability. Then, by Theorem 4.1, we see that U^M(s)U ~ m(s)J ~ M(s). 



Thus the result follows. □ 

Let U £ U (N) be a computable, and let M. be a POVM measurement described by a universal 
semi-POVM. Suppose that, any given state p, we first evolve p by the unitary transformation 
U, and then perform the measurement M for the transformed state (i.e., UpW). Then, by 



Theorem 5.2, the whole POVM measurement for p is shown to be still described by a universal 



semi-POVM. 



Acknowledgments 

The author is grateful to Hiroshi Imai and Keiji Matsumoto for their support. 



References 

[1] Bernstein E. and Vazirani U., Quantum complexity theory, SIAM J. Comput., 26 (1997), 
1411-1473. 

[2] Berthiaume A., van Dam W., and Laplante S., Quantum Kolmogorov complexity, J. Com- 
pute. System ScL, 63 (2001), 201-221. 



11 



[3] Bhatia R., Matrix Analysis, Springer, New York, 1996. 

[4] Calude C. S., Hertling P. H., Khoussainov B., and Wang Y., Recursively enumerable reals 
and Chaitin 0, numbers, Theoret. Comput. Sci., 255 (2001), pp. 125-149. 

[5] Calude C. S., Information and Randomness: An Algorithmic Perspective, 2nd Edition, 
Revised and Extended, Springer, Berlin, 2002. 

[6] Chaitin G. J., A theory of program size formally identical to information theory, J. Assoc. 
Comput. Mach., 22 (1975), pp. 329-340. 

[7] Chaitin G. J., Incompleteness theorems for random reals, Adv. in Appl. Math., 8 (1987), 
pp.119-146. 

[8] Gacs P., Quantum algorithmic entropy, J. Phys. A: Math. Gen., 34 (2001), pp.6859-6880. 

[9] Horn R. A. and Johnson C. R., Matrix Analysis, Cambridge University Press, Cambridge, 
1985. 

[10] Kucera A. and Slaman T. A., Randomness and recursive enumerability, SIAM J. Comput., 
31 (2001), pp.199-211. 

[11] Nielsen M. A. and Chuang I. L., Quantum Computation and Quantum Information, Cam- 
bridge University Press, Cambridge, 2000. 

[12] Preskill J., Quantum Computation, 2000. Course notes available at URL: 
http : //www . theory . caltech . edu/people/preskill/ph229/. 

[13] Vitanyi P. M. B., Quantum Kolmogorov complexity based on classical descriptions, IEEE 
Trans. Inform. Theory, 47 (2001), pp.2464-2479. 

[14] Li M. and Vitanyi P. M. B., An Introduction to Kolmogorov Complexity and Its Applica- 
tions, Second Edition, Springer, New York, 1997. 

A Quantum universal semi-density matrix 

We reproduce the definition of quantum universal semi-density matrix from || as follows. 

Definition A.l. Let a: N + — > Ujv>i Her(iV). We say a is a lower semicomputable semi-density 
matrix if a satisfies the following conditions: 

(i) For each N G N + ; ^ a(N) G Her(iV) and tr(<r(JV)) < 1. 

(ii) There exists a total recursive function f : N + xN^ Ujv>i HerQi(iV) such that, for each N G 
N + ; lim fc ^ 00 /(iV,A;) = a(N) and Vfc G N f{N,k) G Her Q (iV) & f{N,k) ^ f(N,k + l). 

Definition A. 2. Let n be a lower semicomputable semi-density matrix. We say \x is a quantum 
universal semi-density matrix if for any lower semicomputable semi-density matrix a, there exists 
a real number c > such that, for all N G N + , co~(N) ^ l-t(N). 

Theorem A. 3. If fi is a quantum universal semi-density matrix, then tr(/x(A r )) < 1 for all but 
finitely many N G N + . 
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Proof. Since is a lower semicomputable semi-density matrix, there exists a total recursive 
function / on N+ x N such that, for each N G N + , lim^oo f(N, k) = fj,(N) and V k G N f(N, k) G 
Her Q (iV) & /(JV,fc) «S fx(N). Let ^ (iV) be the (i, i)-element of fJ,(N), and let fu(N, k) be the 
(i, i)-element of f(N,k). Then, since tr(/x(iV)) < 1, we see that ^(iV) < 1 - fjj(N, k). 
Especially, for any N with tr(/i(iV)) = 1, we have fi u (N) = lim^oo 1 — Ylj^i fjj(N, k). On the 

other hand, it follows from X/i=l MwC^O — 1 that mm {Mii(-^0 I 1 — * ^ ^} — 1/^- Therefore, 
any given e > 0, for each sufficiently large N, there is i such that 1 < i < N and /u s (iV) < e. 

Now, contrary to Theorem [A. 3] , let us assume that, for infinitely many N G N + , tr(/x(iV)) = 
1. Then, given e G Q + , by checking for each (N,i,k) in an exhaustive order whether 1 — 
Ylj^i fjj(N,k) < e holds or not, one can find (N,i) such that ^(N) < e. Let m be any 
one universal probability, and we define the lower semicomputable semi-density matrix a by 
o~(N) = diag(m(( / 9 _1 (l)), . . . , m(ip^ 1 (N))). Then, since /x is a quantum universal semi-density 
matrix, for this a, there is c a > such that if 1 < % < N then c CT m((/? _1 (i)) < /^(iV). It follows 
that there exists a total recursive function r: N — > X* such that, for any n G N, m(r(re)) < 2~ n . 
This contradicts Theorem 2.4. Thus we have Theorem A. 3. □ 



B On the definition of lower-computable semi-POVM 

The following theorem guarantees that one can equivalently assume that f(n,s) converging to 



R(s) is non-decreasing in Definition 3.2 



Theorem B.l. R is a lower- computable semi-POVM if and only if R is a semi-POVM on S* 
and there exists a total recursive function /: N x £* — > HerQ ) (A r ) such that for each s G £*, 
lim^oo f(n, s) = R(s) and VnSN f(n, s) ^ /(re + 1, s). 



For the proof of Theorem [B.l| we need the following lemma, which is an elementary result of 
linear algebra. 

Lemma B.2. For any A G Her(iV), ^ A if and only if all principal minors of A are non- 
negative. 



By Lemma B^, given A and B in HerQ(iV), one can effectively check whether A ^ B holds 
or not. 



Proof of Theorem \B.j\ . Assume that R is a semi-POVM on S* and there exists a total recursive 
function / :NxE* — > Herq)(iV) such that for each s G £*, lim n ^oo /(re, s) = R(s) and Vn G 
N f(n,s) ^ R(s). Let j:NxS*^ HerQ(iV) be a total recursive function such that g(n,s) = 
/(re, s) — 2~ n I. Then, for each s G £*, lim n ^ oo g(n, s) = R(s) andVn G N g(n,s) < R(s). Thus, 
for each s and n, there is a positive real number c such that cl ^ R(s) — g(n, s), and then, for this 
c, there is an m G N such that m > re and R(s) — g(m, s) ^ cl. So we have g(n, s) ^ g(?re, s). 
Thus, given s and re, by checking g(n, s) ^ g(k, s) for each k > n in increasing order, one 
can finally find an m with g(n,s) ^ g(m,s). Therefore there exists a total recursive function 
r : Nx£* — > N such that, for each s and n, r(n, s) < r(n+l, s) and g(r(n, s), s) ^ g(r(re+l, s), s). 
We define a total recursive function h: N X £* — > HerQ>(iV) by h(n,s) = g(r(n, s), s). Then, 
for each s G £*, linin^oo /i(n, s) = R(s) and Vn G N /i(re, s) ^ /i(n + l,s). Hence, i? is a 
lower-computable semi-POVM. 

The other implication is obvious. Thus the theorem is obtained. □ 
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